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Field of the Invention 

The present invention relates generally to recombinant DNA technology and, 
5 more particularly, to in vitro methods for constructing and screening DNA libraries 
for DNA sequences that encode biologically active molecules. 

Background of the Invention 



1 0 recombinant DNA library can be a difficult task. The use of hybridisation probes 

may facilitate the process, but their use is generally dependent on knowing at least a 
portion of the sequence of the gene which encodes the protein. When the sequence is 
not known, DNA libraries can be expressed in an expression vector, and antibodies 
have been used to screen plaques or colonies for the desired protein antigen. This 

15 procedure has been useful in screening small libraries, but rarely occurring sequences 
which are represented in less than about 1 in 10 5 clones, as is the case with rarely 
occurring cDNA molecules or synthetic peptides, can be easily missed, making 
screening libraries larger than 10 6 clones at best laborious and difficult. Screening 
larger libraries has required the development of methods designed to address the 

20 isolation of rarely occurring sequences, which are based on the co-selection of 
molecules, along with the DNAs that encode them. In vivo methods have been 
developed to screen large libraries, such as phage display and "peptides on plasmids" 
using lad fusions of peptides. 



25 filamentous bacteriophage coat proteins and their expression in a bacterial host 

resulting in the display of foreign peptides on the surface of the phage particle with 
the DNA encoding the fusion protein packaged in the phage particle (Smith G. P., 
1985, Science 228: 1315-1317). Libraries of fusion proteins incorporated into phage, 
can then be selected for binding members against targets of interest (ligands). Bound 

30 phage can then be allowed to xewfectEscherichia coli (E. coli) bacteria and then 

amplified and the selection repeated, resulting in the enrichment of binding members 



Isolating an unknown gene which encodes a desired peptide from a 



Phage display is based on DNA libraries fused to the N-terminal end of 



2 

(Parmley, S. F., &-Smith, G. P. 1988., Gene 73: 305-318; Barrett R. W. et al, 1992, 
Analytical Biochemistry 204: 357-364 Williamson et al, Proc. Natl. Acad. Sci. USA, 

90: 4141-4145 pvlarks ^"at7l:9 9trJrtfoIrBigb32-2r58fr'5 97y. — 

Lad fusion plasmid display is based on the DNA binding ability of the lac 
5 repressor. Libraries of random peptides are fused to the C-terminal end of the lad 
repressor protein! Linkage of the Lacl-peptide fusion to its encoding DNA occurs via 
the lacO sequences on the plasmid, forming a stable peptide-LacI-peptide complex. 
These complexes are released from their host bacteria by cell lysis, and peptides of 
interest isolated by affinity purification on an immobilised receptor target. The 
1 o plasmids thus isolated can then be reintroduced into E. coli by electroporation to 

amplify the selected population for additional rounds of screening (Cull, M. G. et al. 
1992. Proc. Natl. Acad. Sci. U.S.A. 89:1865-1869). 

These bacterial methods are limited by the size of the library that can be 
created by current methods of introducing DNA into host bacteria, the potential 
15 cellular toxicity of the expressed peptides introduced, and by the inability to 

introduce post-translational modifications, or to incorporate novel amino acids into 
the expressed peptide. 

An entirely in vitro ribosome system has been described based on the linkage 
of peptides to the RNA encoding them through the ribosome (W09 1/05058). 
20 Ribosome display has also been used for the selection of single-chain Fv antibody 
fragments (scFv) (Matheakis, L. C. et al., 1994 Proc. Natl. Acad. Sci. USA, 91: 
9022-9026; Hanes, J. & Pluckthun, A. 1997 Proc. Natl. Acad. Sci. USA, 94: 4937- 
4942). This method suffers from the lower stability of the RNA genetic material and 
the increased degradation likely under certain selection conditions where RNAse is 
25 likely to be present. 

The in vitro method described by Griffiths and Tawfik (WO 99/02671 and 
WO 00/40712) addresses some of these concerns by compartmentalizing DNA prior 
to expression of peptides, which then modify the DNA within the compartment. 
Peptides capable of modifications, resulting from enzymatic activity of interest, are 
30 then selected in a subsequent step. However, no direct selection of peptide binding 
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activity is possible of both peptide and DNA without modification of the DNA 
encoding that peptide, and by releasing the modified DNA from the compartment. 

Another prior art method, covalent display technology, or CDT, is described 
in W098371 86. This method relies on covalent linkage of protein to DNA to retain 

5 the linkage of genotype to phenotype, through the cis action of the crosslinking 

protein. This method teaches that two requirements are needed for successful use of 
this technique. Firstly, proteins are required which interact in vitro with the DNA 
sequence which encodes- them (cis action), and secondly, said proteins must establish 
a covalent linkage to their own DNA template. This method suffers from the fact that 

10 the DNA is chemically modified which can prevent the recovery and identification of 
the binding peptide of interest. 

There remains a need for more versatile in vitro methods of constructing 
peptide libraries in addition to the methods described above, which can allow direct 
selection of binding activity, as well as for enzymatic activity, and that allow 

1 5 efficient production of complex peptide structures, while still allowing recovery of 
intact genetic material encoding the peptide of interest. 

Summary of the invention 

The present invention therefore provides a method for producing an in vitro 
20 peptide expression library comprising a plurality of peptides, wherein each peptide is 
linked to a DNA construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) a DNA target sequence; 

(ii) DNA encoding a library member peptide; and 

25 (iii) DNA encoding a peptide capable of non-covalently binding 

directly or indirectly to said DNA target sequence of (ii); 
wherein said DNA construct and encoded protein are selected to have 
cis-activity; 

(b) expressing a plurality of DNA constructs according to (a), wherein 
30 said DNA constructs encode a plurality of library member peptides 
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such that each expressed peptide is non-covalently linked to the DN A 
from which it was produced. 

- - : -^o-povide-d iraTnefaHHona^^ 

comprising a plurality of peptides, wherein each peptide is linked to the DNA 
construct encoding the peptide, comprising the steps of: 
(a) providing a DNA construct comprising: 

(i) DNA encoding a library member peptide; and 

(ii) DNA encoding a peptide capable of non-covalently binding to 
a bifunctional agent; 

) wherein said DNA construct and encoded protein are selected to have 

cis-activity; 

(b) binding a bifunctional agent or a DNA tag capable of binding a 
bifunctional agent to said DNA construct of (a), wherein said 
bifunctional agent is capable of binding to the peptide encoded by said 

5 DNA of (ii); and 

(c) expressing a plurality of DNA constrcuts according to (b), wherein 
said DNA constructs encode a plurality of library member peptides 
such that each expressed peptide is linked via said bifunctional agent 
to the DNA from which it was produced. 

20 The present invention extends to the peptide libraries produced by such 

methods and to the DNA constructs used in such methods. 

The present invention also provides methods of screening in vitro peptide 
expression libraries of the invention. In one aspect there is provided a method of 
identifying and/or purifying a peptide exhibiting desired properties from an in vitro 
25 peptide expression library produced according to the method of any one of the 

preceding claims, comprising at least the steps of (a) screening said library and (b) 
selecting and isolating the relevant library member. In a second aspect, there is 
provided a method of identifying a specific ligand binding peptide, said method 
comprising at least the steps of (a) screening an in vitro peptide expression library 
30 produced according to the method of the invention with ligand molecules which are 
optionally bound to a. solid support; (b) selecting and isolating a library member 
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binding to said target molecule; and (c) isolating the peptide which binds specifically 
to said target molecule. In a third aspect there is provided a method of identifying 
and/or purifying a peptide having the ability to bind a specific DNA target sequence 
comprising at least the steps of (a) providing an in vitro expression library according 
5 to the invention wherein said peptide or protein of (iii) is a library member peptide 
having DNA binding activity and wherein said DNA target sequence of (i) is the 
target sequence of interest; (b) selecting and isolating a library member in which the 
encoded protein binds to said target sequence; (c) isolating the peptide which binds to 
said target sequence. 

10 In addition to isolating and/or identifying specific peptides from the libraries 

of the invention, the screening methods of the invention may be used to isolate 
and/or identify the DNA encoding specific peptides from the library. 

Brief Description of the Figures 

15 Figure 1 gives a schematic representation of a method by which a DNA 

construct of the invention may be linked to the peptide that it encodes. 

Figure 2 give a schematic representation of a method of the invention by 
which a DNA binding protein may be converted to- a cis-acting DNA binding protein. 
Figure 3 gives a schematic representation of how a target sequence specific 
20 DNA binding protein may be isolated from a library of the invention. ■ 

Figure 4 gives a schematic representation of how a library protein may be 
linked to its coding DNA through cis action and the use of a bi-specific binding 
molecule. 

Figure 5 shows the specificty of anti-V5 antibody binding clones. ELISA 
25 screening, read at 450nM, of the seven clones (1-7) that show specific binding to 

anti-V5 antibody. The bars in group of four represent the ELISA signal of the clones 
screened against from left to right; anti-human kappa region antibody; anti-V5 
antibody, BSA, and blank. A negative control that neither express CK nor V5 is also 
presented (8). 
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Brief Description of the Sequences 

SEQ ID Nos 1 to 1 1 , 19 to 23, 26 and 27 show the primers used in the 

— .- "Examples; r — . '■ r 

SEQ ID NO: 12 shows the sequence of the TAC-MYC-CK-REPA-CIS-ORI 
5 construct, SEQ ID NO: 13 shows the sequence of the TAC-MYC-V5-REPA-CIS- 
OPJ construct, SEQ ID NO: 24 shows the sequence of the TAC- V5 -REP A-CIS-ORI- 
408 construct and SEQ ID NO: 25 shows the sequence of the TAC-NNB-REPA-CIS- 
ORI-408 construct. 

SEQ ID NO: 14 shows the estrogen receptor target recognition sequence. 
1 o SEQ ID Nos 1 5 and 1 6 show the DN A and amino acid sequences of the repA 

gene from the Rl plasmid of the incFII incompatibility group. SEQ ID Nos 17 and 
18 show the sequences of the CIS DNA element and ori sequence form the same 
system. 

15 Detailed Description of the Invention 

The present invention relates to the construction and screening of a library for 
a nucleotide sequence which encodes a peptide of interest in vitro. The constructs 
encoding the peptide of interest are designed such that the expressed peptide shows 
cis activity for the construct. Cis activity is defined as the ability of the peptide to 
20 bind to the DNA from which the peptide was produced, i.e. from which it was 

transcribed and translated. In vitro expression of the construct results in binding of 
the peptide to the DNA encoding that same peptide molecule by non-covalent 
interaction. This differs from. the teaching of WO 98/37186, which does not allow for 
the possibility of in vitro non-covalent interaction between protein and the DNA it 
25 encodes, and indeed specifically excludes such interactions from having any practical 
use for library screening. The present invention has the advantage over the methods 
of WO 98/37186 that the encoded protein may be separated from the DNA which 
encoded it without damaging the DNA. 

Non-covalent binding refers to an association that may be disrupted by 
30 methods well known to those skilled in the art, such as the addition of an appropriate 
solvent, or a change in ionic conditions, for example, the addition of low pH glycine 
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or high pH triethylamine. In the present case, a typical example of non-covalent 
binding would be the non-covalent interaction between a DNA binding protein and a 
DNA molecule. Conversely, when a covalent linkage is formed between the DNA 
and the encoded polypeptide, the displayed peptide or protein will not be released 
5 from the DNA by ionic conditions and solvents that would disrupt non-covalent 
DNA binding protein:DNA interactions. For example, the bacterial replication 
protein repA binds non-covalently to its target DNA sequence oriR and can be 
released from this target DNA sequence at salt concentrations greater than 0.2M KC1 
(Giraldo R. & Diaz R. 1992 J. Mol. Biol. 228: 787-802). This salt concentration 
1 0 would not affect a covalent linkage, which would require much harsher conditions to 
release the covalently bound protein, with the increased risk of damage to the 
recovered DNA. 

The current invention describes cis activity and non-covalent binding which 
allow the encoded peptide or protein to remain associated with the DNA construct 
1 5 with a half life sufficient to allow individual peptides and the associated DNA 

encoding that peptide with an activity of interest to be separated from the resulting 
mixture of protein DNA complexes. For example, the association between the 
encoded protein and its DNA may have a half life of up to 30 minutes, up to 45 
minutes, up* to one hour, up to 2 hours, up to 6 hours or up to 12 hours. The 
20 screening methods of the invention may therefore be carried out immediately after 
construction of the library, or later, for example up to one, up to two, up to six, up to 
twelve hours or up to twenty four hours later. 

Surprisingly, therefore, the invention described herein demonstrates that such 
encoded peptides or proteins can be expressed in vitro and bound to the DNA 
25 encoding that peptide in the presence of other DNA sequences. The invention also 
* demonstrates that covalent linkage between protein and DNA is not required to 

maintain such cis activity, and that a non-covalent interaction between DNA and 
. binding protein is sufficient to allow selection of peptides in an in vitro expression 
and selection system. Additionally, the invention demonstrates that a peptide of 
30 interest can be selected from a mixture of non-binding peptides, and the DNA 
encoding that peptide may be recovered. 
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According to the present invention, individual DNA library members, each of 
which encodes a peptide to be expressed in the peptide expression library (library 

-~TneTfib?rTeptide)rare^laced 

which the DNA library member is placed includes all the sequences necessary to 
5 allow expression of the library member peptide from the construct and to allow the 
peptide encoded by the construct to bind to the DNA construct which encoded it. 
Each peptide in the library will typically comprise a fusion protein comprising the 
library member peptide fused to a peptide involved in binding of the fusion protein to 
the relevant DNA construct. Such fusion proteins may comprise further sequences 
10 and said library peptide may be joined to said binding peptide via a linker sequence. 
A plurality of such constructs, encoding a plurality of different library 
member peptides form a DNA library of the invention. Expressing such a library of 
DNA molecules results in the non-covalent binding of individual encoded proteins to 
the DNA which encoded them and from which they have been transcribed and 
15 translated, in the presence of many other DNA molecules that encode other members 
of the library: The sequence encoding the peptide library member present in a 
particular encoded protein will therefore be present in the DNA which is bound to 
that protein. This process therefore links the library member peptide, in a . 
biologically active form (usually having a binding activity) to the specific library 
20 member DNA sequence encoding that peptide, allowing selection of peptides of 
interest, for example due to a particular binding activity, and subsequent isolation 
and identification of the DNA encoding that library member peptide. 

For the purposes of the invention a DNA library is therefore a population of 
DNA constructs. Each construct comprises a DNA sequence encoding a peptide to 
25 be expressed as a library member peptide and each contains appropriate promoter, 
translation start and stop signals. A DNA library of the invention will contain a 
plurality of such DNA molecules. A plurality of DNA constructs are provided each 
encoding a library member peptide to provide a plurality of different library 
members. Preferably a DNA library will contain at least 10 4 discrete DNA 
30 molecules. For example, a DNA library may contain more man 10 6 , more than 10 8 , 
more man 10 10 - more than 10 12 or more than 10 14 discrete DNA molecules.. 



A peptide expression library is defined as a population of peptide sequences 
expressed from a library of DNA molecules. A peptide expression library of the 
present invention therefore encompasses a library of peptides which are non- 
covalently bound to the DNA which encoded them. For example, a peptide 
5 expression library of the present invention may be a library of at least 10 4 discrete 
proteins each comprising a library peptide sequence, expressed from a library of 
DNA molecules. A peptide expression library of the invention may be any library 
formed by the expression of a DNA library of the present invention. 

A peptide library member can be defined as an amino acid chain of random 
1 0 composition of at least two amino acids in length, or part or all of a naturally 

occurring protein such as an enzyme, a binding molecule such as an antibody or a 
fragment thereof. 

A DNA construct according to the present invention may comprise DNA 
encoding a library member peptide and means for the encoded peptide to bind to the 
1 5 encoding DNA construct. In addition to DNA encoding a library member peptide, a 
suitable DNA construct of the invention comprises at least a DNA target sequence 
and DNA encoding a peptide capable of binding directly or indirectly to the DNA 
target sequence. 

According to the present invention, the DNA construct and the encoded 
20 protein are selected to have cis-activity. That is, the encoded protein has the ability 
to bind specifically to the DNA molecule which encoded it. For example, cis-activity 
may function to allow the encoded DNA binding peptide to bind specifically 
(directly or indirectly) to the target sequence of the DNA construct which encoded it 
rather than to the target sequence of another DNA construct. 
25 In some cases, cis activity may be provided by a cis-acting DNA element. In 

other cases, a separate cis acting DNA element per se may not be required where the 
nature of the encoding DNA inherently confers cis activity on the encoded peptide. 

A cis-acting DNA element may be provided in the DNA construct together 
with the DNA encoding a peptide that interacts with that cis element. For example, 
30 in the case of the cis element from the rep A system discussed below, DNA eroding 
a fragment of the repA sequence comprising at least 20 amino acids from the C 
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terminal of repA may be provided along with the cis DNA element. It may be 
possible to confer cis activity upon a DNA binding peptide that is not normally cis- 

act ing by incl u ding in a gDNXTBClBttqctTO 

sequences necessary for its action. For example, DNA encoding a peptide that 
5 interacts with the cis element used may be included in the DNA construct. 

Alternatively, a peptide that interacts with the cis element may be part of the 
DNA binding peptide. For example, the DNA binding peptide may be repA which 
comprises the sequence that interacts with the repA cis element. Alternatively, the 
DNA binding peptide may bind to its encoding DNA in cis without the need for a 

10 separate cis element. 

A suitable cis-acting DNA element may be any element which allows cis- 
activity. Such a cis-acting DNA element may act, for example, by interacting with 
the machinery involved in translation and transcription of the DNA construct to delay 
the production and release of the encoded peptide. 

Any DNA element which allows the encoded peptide to bind specifically to 
the DNA molecule which encoded it may be used as a cis-acting DNA element 
according to the present invention. One example of a suitable cis-acting DNA 
element is that of the repA-cis system described in more detail below. In that system, 
RNA polymerase is paused by loops in the 5' cis sequence prior to the rho dependent 
20 termination of transcription. The action of the cis-acting DNA element therefore 
allows the encoded binding peptide to bind to the DNA target sequence in the 
construct from which it was produced. 

Preferably, the cis DNA element will be be located 3' in the DNA construct to 
the library member peptide and to the peptide or protein capable of binding to the 
25 DNA target sequence. This means that these sequences may be transcribed and 
translated before the RNA polymerase reaches the cis acting sequence. 

According to the present invention, the binding peptide may be linked to the 
DNA construct directly or indirectly. In the case of direct binding, the binding 
peptide binds directly and non-covalently to the DNA target sequence. In the case of 
30 indirect binding, the link between the binding peptide and DNA construct is provided 



15 
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by a further molecule. Such a molecule, for example a bifunctional agent as 
described below, will associate with both the peptide and the DNA target sequence. 

A suitable DNA construct may comprise further sequences, for example 
suitable promoter Sequences to allow expression of the encoded peptide . 
5 One example of a system in which cis-activity exists is the a cis acting 

incompatibility group plasmid replication protein, termed repA, system. Aspects of 
this system may be utilised in the present invention as explained below. 

Numerous plasmids include sequences encoding repA and cis DNA elements. 
The rep A sequence and cis DNA element present in a DNA construct of the 
1 0 invention may be derived from the same plasmid strain or may be derived from 
different plasmid strains. 

It is believed that the repA-cis system acts as shown in Figure 1 . Briefly, 
. RNA polymerase is paused by loops in the 5'-CIS sequence prior to rho dependent 
termination of transcription. This allows transient C-terminal repA peptide 
1 5 interaction with CIS, and possibly DNA bending. RepA peptide then binds to ori, 
which is a defined distance away from the terminal amino acid of the repA coding 
sequence (Prazkier et al. 2000 J. Bacteriology 182: 3972-3980; Praszkier and Pittard 
1999 J. Bacterid. 181: 2765-2772; Masai and Arai. 1988 Nucleic Acids Res. 16: 
6493-6514). 

20 The compatibility of a RepA sequence from a plasmid with a cis sequence 

from another plasmid can be readily determined by monitoring for the interaction of 
RepA with the selected cis sequence. 

Suitable repA proteins and sequences and cis DNA elements include those of 
the IncI complex plasmids or the IncF, IncB, IncK, IncZ and IncL/M plasmids, which 

25 are distantly related at the DNA level, but which control plasmid replication through 
the action of the cis acting repA protein (Nikoletti et al. 1986 J. Bacteriol. 170:131 1- 
1318; Prazkier J. et al. 1991 J. Bacteriol. 173: 2393-2397). Specific plasmids which 
may be used to provide these sequences include the Rl plasmid of the IncII 
incompatibility group and the incB plasmid pMU720 (described by Praskier J. & 

30 Pittard J. 1 999 Role of CIS in replication of an IncB plasmid. J. Bacteriol. 181: 2765- 
2772). The DNA and amino acid sequences of repA derived from the Rl plasmid of 
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IncH are given in SEQ ID Nos: 15 and 16. The DNA sequence of the cis DNA 
element from the Rl plasmid of IncH is given in SEQ ED NO: 17. Typically, the cis 

e lement i sT30io20<rHIBte^ : " ~ 

vised, so long as the sequence maintains the ability to interact with RepA and display 
5 cis activity . Minor variations, such as substitutions or deletions within the cis 

sequence are also contemplated such as modifications at 1, 5, 10 up to 20 nucleotides 
within the wildtype cis sequence. 

The cis element is required for cis activity of the rep A protein (Praszkier and 
Pittard 1999 J. Bacterid. 181: 2765-2772). The cis DNA element should therefore 
10 also be located 3' in the DNA construct to the DNA encoding the repA sequence. On 
reaching the cis sequence, the RNA polymerase will be paused, allowing the encoded 
protein to bind the DNA target sequence. 

In one embodiment of the present invention, the DNA binding protein itself 
comprises RepA or a fragment or variant thereof capable of DNA binding, including 
1 5 at least the 20 C-terminal amino acids of RepA capable of binding to the cis DNA 
element. In this embodiment, the DNA target sequence comprises an ori sequence, 
for example the oriR sequence. In alternative aspects of the present invention, the 
DNA binding protein is provided by an alternative protein with the relevant DNA 
target sequences recognised by such binding protein incorporated into the sequence. 
20 In each of these embodiments, DNA-protein binding is direct in that the peptide 

encoded by the DNA construct will bind directly to the encoding DNA construct. In 
alternative aspects of the invention, as described in more detail below, the DNA- 
protein binding may be indirect through the use of a peptide tag-DNA tag, 
bifunctional agent and/or suitable linkers. 
25 In one aspect, the same sequence may therefore provide both the peptide 

capable of binding the DNA target sequence and the C terminal ammo acids of repA. 
Such a sequence may be or may comprise a complete repA sequence, or a fragment 
or variant thereof of a rep A sequence which retains the ability to bind to the DNA 
target sequence used. Where the repA acts as a DNA binding protein, both cis and 
30 ori sequences (Praszkier and Pittard 1999 J. Bacterid. 181 : 2765-2772) are required 
for cis activity (cis) and DNA binding (ori). In this aspect, therefore, the DNA target 
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sequence is an ori sequence and the peptide or protein capable of binding said target 
is a repA protein. The position of ori in the DNA constructs of the invention may be 
varied. As described earlier, suitable repA, cis and ori sequences may be provided by 
one or more plasmids. For example, suitable sequences may be provided from the 

5 IncI complex plasmids or the IncF, IncB, IncK, IncZ and IncL/M plasmids. The 
DNA sequence of the ori from the Rl plasmid of IncII is given in SEQ ID NO: 18. 
This sequence, or a fragment thereof may be included in a DNA construct of the 
invention. A DNA construct of the invention may include a complete ori sequence or 
may include a fragment thereof which is capable of being bound by the rep A protein 

10 being used. 

The RepA protein used in accordance with the present invention may also 
comprise a fragment or variant of RepA, so long as such variant or fragment of RepA 
maintains the ability to bind to the selected ori sequence. Such variant or fragment of 
RepA may include substitutions, for example, at 1, 2, 3 up to 20 amino acids within 

15 the RepA sequence so long as such variants maintain the ability to bind to the ori 
sequence. A suitable fragment of RepA is an ori binding sequence of RepA. Ori 
sequences include those which are present in wild type plasmids as described above. 
Typically, such an ori sequence is 170 to 220 nucleotides in length. Fragments and 
variants of wild type ori sequences may also be used, so long as such ori sequences 

20 maintain the ability- to be recognised by RepA. Suitable ori sequences for use in 
combination with selected RepA proteins can readily be determined by monitoring 
for the interaction of RepA with such an ori sequence. 

The basic principle of the invention may therefore be described with reference 
to the repA/cis/ori system, as shown in Figure 1 . This shows an example of a DNA 

25 construct of the invention. This construct comprises, from 5' to 3\ a promoter 
sequence, a sequence encoding a library member peptide, a sequence encoding a 
repA protein, a cis DNA element ajid an ori sequence. Briefly, the DNA sequence is 
transcribed from the promoter by RNA polymerase to RNA. The rho dependent 
termination function present in the cis DNA element causes the RNA polymerase to 

30 pause at this part of the sequence. This allows the repA protein and the library 
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peptide to be translated. The repA protein is then able to bind to the ori sequence, 
linking the encoded protein to the encoding DNA construct. 

- in one preferred^t^o^dmrentriibrary member-BNA-sequeaGe(-s)-are-fased-to- 



the repA, cis and ori DNA of the IncFH plasmid Rl (Masai H et al. 1983 Proc Natl. 
Acad Sci USA 80: 6814-6818). In this embodiment, the library member DNA 
sequenced) of interest may be joined by a region of DNA encoding a flexible amino 
acid linker, to the 5'-end of the repA DNA, under the control of an appropriate 
promoter and translation sequences for in vitro transcription/translation. Many 
suitable promoters are known to those skilled in the art, such as the araB, tac 
promoter or the T7, T3 or SP6 promoters, amongst others. The promoter should be 
upstream of the polypeptide sequence to be expressed. 

The repA family of proteins is used herein by way of example, not limitation. 
Other unrelated non-covalently binding cis acting DNA binding proteins could be 

used in this invention. 
5 In a further embodiment, non-cis acting DNA binding proteins may be 

converted to having cis-activity (see Figure 2). This may be achieved by using such 
proteins, capable of binding the DNA target sequence, either directly or indirectly, in 
combination with sequences which can confer cis-activity upon them. Cis activity 
may be conferred on a binding protein that does not normally act in cis by including 
0 in the DNA construct a cis-acting DNA element such as the cis element of the repA 
system. Such an element may be included to ensure that the DNA binding by the 
DNA binding protein is cis, that is, an encoded DNA binding protein will bind to the 
DNA construct from which it has been transcribed and translated. 

In one embodiment, a suitable DNA construct may therefore comprise the cis- 
25 acting DNA element from the repA system. Such an element may further comprise 
DNA encoding a portion of the C-terminal end of RepA, preferably at least 20 amino 
acids, more preferably 30 amino acids, up to 40, 50, 60 or 70 amino acids from the 
C-terminal portion of repA, wherein said fragment of repA is capable of interacting 
with the cis-acting DNA element within the construct. DNA encoding sequences of 
30 the present invention may comprise wild type sequences encoding the desired 

fragment of RepA, degenerate sequences encoding fragments of wild type RepA or 
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sequences encoding variants of such fragments of RepA which maintain the ability to 
interact with the cis element incorporated into the DNA construct. Such variants 
may include substitution of 1, 2, 3 or 4 amino acids within the 20 amino acid C- 
terminal of RepA. 

5 The rep A family of proteins is used herein by way of example, not limitation. 

Any cis-acting DNA element capable of conferring cis-activity on a non-cis acting 
protein could be used . 

Any non-cis acting protein may be converted in this way. By way of 
example, not exclusion, the estrogen receptor DNA binding domain (DBD) can be 

10 converted into a cis acting DNA binding protein. The oestrogen receptor DNA 

binding domain fragment (amino acids 176-282) has been expressed in E. coli and 
shown to bind to the specific double stranded DNA oestrogen receptor target HRE 
nucleotide sequence, with a similar affinity (0.5nM) to the parent molecule (Murdoch 
et al. 1990, Biochemistry 29: 8377-8385; Mader et al., 1993, DNAs Research 21: 

15 1 125-1 132). In one embodiment, the DNA encoding this sequence is fused, 

preferably at the 3 5 -end, to the DNA encoding at least the last 20 amino acids of 
repA, the cis DNA element, and the DNA up to the ori sequence followed by the 
estrogen receptor target recognition sequence (5'-TCAGG TCAGA GTGAC 
CTGAG CTAAA ATAAC ACATT CAG-3', SEQ ID NO: 14) which replaces the 

20 repA ori recognition sequence. The DNA sequence(s) of interest may then be joined 
by a region of DNA encoding a flexible amino acid linker, to the 5 ' -end of to the 
estrogen receptor DNA fragment, under the control of an appropriate promoter and 
translation sequences for in vitro transcription/translation. Expression of this . 
. polypeptide directs the estrogen receptor DBD to its target sequence, present in place 

25 of the normal ori sequence, on the DNA encoding that polypeptide. Protein-DNA 

complexes can then be isolated by capture on a target protein. Unbound protein-DNA 
complexes can be washed away, allowing enrichment for DNA encoding peptides or 
proteins of interest, which can then be recovered by PCR, and enriched further by 
performing several further cycles of in vitro expression and protein-DNA complex 

30 capture using methods described previously . 
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It will be clear that this approach will apply to other DNA binding proteins 
simply by using the cis DNA element and a sequence encoding at least the C- 

-- -tenninatafomiiflo-aeids-of^ cis-actin g 

system in the DNA constructs. 
; In another embodiment, libraries of randomized DNA binding proteins, such 

as zinc finger proteins, helix-loop-helix proteins or heUx-turn-helix proteins by way 
of example, may be screened for specific binding to atarget sequence of interest (see 
Figure 3). In this embodiment, the ori recognition sequence of repA may be replaced 
by atarget sequence of interest, and the majority of the repA coding sequence by a 
0 library of randomised zinc finger proteins. The DNA binding proteins therefore act as 
both the library member peptides and the proteins capable of binding the DNA target 
sequence in this aspect, the DNA encoding each zinc finger protein, may 
additionally be joined, at the 5* -end, to a peptide tag sequence which can be 
recognized by an another capture protein such as an antibody, and at the 3'-end, to 
15 the DNA encoding at least the last 20 amino acids of repA, the cis DNA element, and 
the DNA up to the ori sequence followed by the target sequence of interest. 
Expression of this polypeptide directs the zinc finger protein to the target sequence of 
interest, present in place of the normal ori sequence, on the DNA encoding that 
polypeptide. Binding to the target sequence will only occur if the randomised zinc 
20 finger domain is capable of binding to the sequence of interest. Protein-DNA 

complexes can then be isolated by capture with a binding protein which recognizes 
the peptide tag at the N-terminus of the. fusion protein polypeptide. Unbound DNA 
can be washed away, allowing enrichment for DNA encoding zinc finger proteins 
capable of binding the target sequence, which can then be recovered by PCR, and 
25 enriched further by performing several further cycles of in vitro expression and 
protein-DNA complex capture. 

As explained above, the binding peptide may bind directly to the DNA target 
sequence, for example in the case of a DNA binding protein-target sequence pair, or 
it may bind indirectly to the DNA target sequence, for example via a bifunctional 
30 agent and optionally a DNA tag (see Figure 4): 
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In one embodiment, DNA encoding a peptide tag which is not able to bind 
directly to the DNA target sequence is joined to the 5'-end of library member DNA 
sequenced) of interest, optionally by a region of DNA encoding a flexible amino 
acid linker, under the control of an appropriate promoter and translation sequences 
5 for in vitro transcription/translation. This forms the DNA encoding the binding 
peptide, as the encoded peptide is linked indirectly to the DNA target sequence. 
Optionally at the 3'-end of the library member DNA sequence is the DNA encoding 
at least the last 20 amino acids of repA and the cis DNA element, but not the ori 
target sequence of repA. The DNA target sequence may be or may comprise a DNA 

1 0 tag. Such a DNA tag may be a single modified base. For example, when preparing 
. the library DNA construct containing the elements described, the DNA may be 
tagged at the 3'-end with, by way of example not limitation, molecules such as 
fluorescein or biotin. 

Prior to in vitro expression, the library DNA fragments may be mixed with a 

15 bifunctional agent, one function of which is to recognize and bind to the target 

sequence which may be at the 5' end of the DNA, in a ratio of one DNA fragment: 
one bifunctional molecule. The other functional element of this bifunctional agent is 
a binding agent that can recognize and bind to the peptide tag which may be encoded 
at the 5 '-end of the DNA fragment. By way of example not exclusion, the 

20 bifunctional agent can be composed of an Fab fragment recognizing the fluorescein 
or biotin tag on the DNA, and another Fab fragment recognizing the peptide tag 
encoded in the DNA. It is clear to those skilled in the art that this bifunctional agent 
can be made by many different methods such as chemically cross-linking the two 
elements, or by expressing the two elements as a fusion protein, or as a bi-specific 

25 antibody. Said methods of creating a bifunctional agent are given by way of example 
not exclusion. - 

The bifunctional agent may be bound to the DNA construct prior to 
expression of the encoded peptide or may be provided during expression. 

The fusion protein is then-transcribed and translated from the DNA construct 

30 while bound to the bifunctional agent. The peptide tag is translated first, and can be 
bound by the second element of the bifunctional agent, prior to release of messenger 
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RNA or RNA polymerase from the DNA. This creates a functional protein-DNA 
complex where both expressed polypeptide and DNA encoding that peptide are 
Finked through ihe^tun~clonaTage^^ tag molecule is therefbre linked ~ 

indirectly, but specifically, to the DNA target (tag). By linking the protein to the 
5 DNA construct in this way, it is possible to screen for a protein having particular 
properties, as described below, and then to identify the encoding DNA which is. 
linked to that protein. By using a bifunctional agent rather than covalent binding 
between the protein and DNA the DNA construct may be more easily separated form 
the protein without the risk of damaging the DNA. 
1 o Protein-DNA complexes can then be isolated by capture of a target protein. 

Unbound protein-DNA complexes can be washed away, allowing enrichment for 
DNA encoding peptides or proteins of interest, which can then be recovered by PCR, 
and enriched further by performing several further cycles of in vitro expression and 
protein-DNA complex capture using methods described previously. 
1 5 Additionally, under this embodiment, the DNA can be bound directly, for 

example by covalent binding, to a bifunctional agent such as. a polymer. Such a 
polymer can contain more than one binding element that could recognise the peptide 
tag, allowing multivalent display of a peptide expression library molecule in a unit 
wmtairiing the DNA encoding the displayed peptide. By way of example, not 
20 limitation, said polymers can be composed of polyethylene as well as other 

polymeric compounds, capable of being fused to DNA. The DNA construct of the 
invention may therefore be provided bound to such a bifunctional agent, or bound to 
a DNA tag as decsribed above which is capable of being bound by such a 
bifunctional agent. 

25 In all embodiments of the invention, the DNA constructs include appropriate 

promoter and translation sequences for in vitro transcription/translation. Any suitable 
. promoter can be used, such as the ara B, tac promoter, T7, T3 or SP6 promoters 
. amongst others. The promoter is placed so that it is operably linked to the DNA 
sequences of the invention such that such sequences are expressed. 
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The DNA encoding the library member peptides may be produced by any 
sourcible means. In particular, such DNA may comprise DNA isolated from cDNA, 
obtained by DNA shuffling, and synthetic DNA. 

The DNA construct may also encode amino acid linkers within the expressed 
5 * fusion protein. In particular, a flexible amino acid linker may be included to join the 
DNA binding peptide/RepA to/the library member peptide. 

According to the invention, with reference to this preferred embodiment, 
peptide or protein expression libraries, linked to the DNA encoding them, can be 
generated and peptides with the desired activity selected by the following steps: 

10 Constructing a library of fusion proteins, 

A DNA library of peptides or proteins may be fused to DNA encoding a 
peptide capable of binding to the DNA target sequence, such as a cis acting DNA 
binding protein DNA, by a region of DNA encoding a flexible amino acid linker, 
under the control of an appropriate promoter and with a translation, or ribosome 

15 binding site, start and stop codons, in a manner suitable for in vitro expression of the 
peptide library members and binding proteins. In the example of the repA protein, 
the DNA (such as DNA) library members are fused to the repA DNA binding protein 
DNA, or a fragment thereof. The cis and ori sequences may be included in the 
construct downstream of the other elements. In the case of a DNA, library, said DNA 

20 constructs are designed to be suitable for in vitro transcription and translation. 
Expression and cis binding of DNA library fusion proteins. 

In order to allow cis activity, a coupled bacterial transcription/translation 
environment such as the S3 0 extract system (Zubay, G. 1973. Ann. Rev. Genet. 7; 
267) may be used. Expression of the peptide, such as the DNA library member 

25 peptide-repA fusion protein, in this environment, will result in binding of the fusion 
protein to the DNA encoding that fusion protein, provided that both cis and ori 
sequences are present. When libraries of peptide-repA fusion proteins are expressed 
in this manner, this process results in the production of libraries of protein-DNA 
complexes where the protein attached to the DNA is encoded by that fragment of 

30' DNA from which it was expressed, thereby allowing subsequent selection of both 
peptides or protein of interest, and the DNA encoding said peptides. The complexity 
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of these libraries is enhanced by the in vitro nature of the method, libraries of at least 
10 lo -10 14 DNA fragments, if not even larger libraries, can easily be generated. 

-gQlgctionxtfthe-peptide-of- interest- ■■ 

An in vitro peptide expression library produced by a method of the present 
5 . invention may be used to screen for particular members of the library. For example, 
the library may be screened for peptides with a particular, activity or a particular 
binding affinity. Protein-DNA complexes of interest may be selected from a library 
by, for example, affinity or activity enrichment techniques. This can be accomplished 
by means of a ligand specific for the protein of interest, such as an antigen if the 
0 protein of interest is an antibody. The ligand may be presented on a solid surface 
such as the surface of an ELISA plate well, or in solution, for example, with 
biotinylated ligand followed by capture on to a streptavidin coated surface or 
magnetic beads, after a library of protein-DNA complexes had been incubated with 
the ligand to allow ligand-ligand interaction. Following either solid phase or in 
1 5 solution incubation, unbound complexes are removed by washing, and bound 

complexes isolated by disrupting ligand-ligand interactions by altering pH in the 
well, or by other methods known to those skilled in the art such as protease digestion, 
or by releasing the DNA directly from the complexes by phenol-chloroform 
extraction to denature the repA-ori DNA binding. Recovering bound complexes, 
20 reamplifying the bound DNA, and repeating the selection procedure provides an 

enrichment of clones encoding the desired sequences, which may then be isolated for 
sequencing, further cloning and/or expression. For example, the DNA encoding the 
peptide of interest may be isolated and amplified by, for example PCR In one 
embodiment, repeated rounds of selection and DNA' recovery may be facilitated by 
25 the use of sequential nesting of PCR primers. DNA ends are generally damaged after 
multiple PCR steps. To recover DNA from such damaged molecules required the 
primers to be annealed away from the ends of the DNA construct, thereby 
sequentially shortening the construct with every round of selection. 

In one aspect, the DNA construct and/or the encoded protein may be 
30 configured to include a tag. Such a peptide or DNA tag, for example as described 
above, may be used in the separation and isolation of a library member of interest. 
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Such a tag may also be used to hold the library members, for example on a solid 
support for use in the screening methods described herein. 

It can therefore be seen that the screening methods of the present invention 
may include the further step of selecting and isolating the relevant library member 
5 peptide, allowing the peptide exhibiting the desired properties, and also the DNA 
encoding that peptide, to be identified and purified.. 

Numerous types of libraries of peptides fused to the cis acting DNA-binding 
protein can be screened under this embodiment including: 

(i) Random peptide sequences encoded by synthetic DNA of variable length. 
10 (ii) Antibodies or antibody fragments, for example single-chain Fv antibody 

fragments. These consist of the antibody heavy and light chain variable region 
domains joined by a flexible linker peptide to create a single-chain antigen binding 
molecule. 

(hi) Random cDNA fragments of naturally occurring proteins isolated from a 

1 5 cell population containing an activity of interest. 

(iv) Random peptide sequences inserted into, or replacing a region of a known 
protein, whereby the known protein sequence acts as a scaffold, which constrains the 
random peptide sequence. Many such scaffolds have been described, by way of 
example, not exclusion, CTLA-4 (WO 00/60070), has been used as a scaffold for 

20 peptide libraries. 

In another embodiment the invention concerns methods for screening a DNA 
library whose members require more than one chain for activity, as required by, for 
example, antibody Fab fragments for ligand binding. In this embodiment heavy or 
light chain antibody DNA is joined to a nucleotide sequence encoding a DNA 

25 binding domain of, for example, repA. Typically the unknown antibody DNA library 
sequences for either the heavy (VH and CHI) or light chain (VL and CL) genes are 
inserted in the 5' region of the repA DNA, behind an appropriate promoter and 
translation sequences. Thus, repA fused to a DNA library member-encoded protein is 
produced bound to the DNA encoding that protein. The second known chain, 

30 encoding either light or heavy chain protein, is expressed separately either: 



22 

(a) from the same DNA fragment containing the repA and the first 
polypeptide fusion protein library, or 

(b) frnm « separate frag ment of DNA present in the in v itro '_ 

transcription/translation reaction. 
5 The known chain associates with the library of unknown fusion proteins that 

are fused to the repA protein and thereby bound to the DNA for the unknown chain. . 
The functional Fab library can then be selected by means of a ligand specific for the 
antibody. 

In order that the invention is more fully understood, embodiments will now 
10 be described in more detail by way of example only and not by way of limitation 
with reference to the figures below. 

Examples of some of the embodiments of the invention are given below: 

15 Materials and Methods 

The following procedures used by the present applicant are described in 
Sambrook, J., et al., 1989 supra.: analysis of restriction enzyme digestion products on 
agarose gels, DNA purification using phenol/chloroform stock solutions, preparation 
of phosphate buffered saline. 
20 General purpose reagents were purchased from SIGMA-Aldrich Ltd (Poole, 

Dorset, U.K.). Oligonucleotides were obtained from SIGMA-Genosys Ltd 
(Cambridgeshire, U.K.). Amino acids, and S30 extracts were obtained from Promega 
Ltd (Southampton, Hampshire, U.K.). Deep Vent and Taq DNA polymerases were 
obtained from New England Biolabs (Cambridgeshire, U.K.). Taqplus DNA 
25 polymerase was obtained from Stratagene Inc. (Amsterdam, Netherlands). 
GeneClean DNA gel purification kits were obtained from BIO101 (La Jolla, 
California, U.S.A.), anti-human Igx antibodies from Immunologicals Direct Ltd 
(Oxfordshire, U.K.), anti-c-myc polyclonal from Vector Labs Inc (Cambridgeshire 
U.K.), and anti-V5 antibody from Abeam Ltd (Cambridgeshire U.K.). Superblock 
30 blocking agent was obtained from Perbio Science (Cheshire, U.K.). 
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Example L Isolation of specific cis acting protein-DNA complexes 

The in vitro expression constructs were prepared by sequentially adding the 
TAC promoter, the c-myc epitope, either the human kappa constant region or the V5 
epitope to the RepA-CIS-ORI region, by PCR amplification. Such constructs can be 
5 prepared by many methods known to one skilled in the art, for example, by 

amplifiying different fragments of DNA followed by assembly PCR. In this example, 
. the initial amplification template was the Rl plasmid which contains the RepA-CIS- 
ORI region (Masai, H. and Arai, K.(1988). DNAs Res. 16, 6493-6514). 

(a) Primary amplification. The RepA-CIS-ORI region was PCR amplified 
10 from a single colony of the strain ECO K12 harbouring plasmidtRl using 12.5pmol • 

of each of the primers REPAFOR (SEQ ID 01) and ORIREV (SEQ ID 02) in a 50jil 
reaction containing 0.25mM dNTPs, 2.5 units Taqplus Precision DNA polymerase, 
lx PCR reaction buffer (Stratagene Inc, Amsterdam, Netherlands). The REPAFOR 
primer anneals to the 5' -end of the RepA coding region. The ORIREV primer 

1 5 anneals to the 3 '-end of the non-coding ORI region. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 45 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 

20 the gel into 40|il sterile water using a Geneclean II kit accordingto the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(b) Secondary amplification. One |j.l (500 pg) of 100 times diluted gel- 
purified primary reaction product was re-amplified using 12.5pmol of each of the 
primers CKREPFOR (SEQ ID 03) and ORIREV (SEQ ID 02) in a 50jxl reaction 

25 containing 0.25mM dNTPs, 2.5 units Taqplus Precision DNA polymerase, and lx 
PCR reaction buffer (Stratagene Inc, Amsterdam, Netherlands). The CKREPFOR 
primer anneals to the 5* -end of the primary reaction product and appends the 3' part 
of the kappa constant region DNA. The ORIREV primer anneals to the 3' -end of the 
primary reaction product. 

30 PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 

minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
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seconds; 12° C, 2 minutes, followed by a single cycle 1 0 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40pl sterile water using a Geneclean 11 kit according toThe manufacturers " 
instructions (BiolOl, La Jolla, California, U.S.A.). 
5 (c) Third amplification. One p.1 (500 pg) of 100 times diluted gel-purified 

primary reaction product was re-amplified using 12.5pmol of each of the primers 
V5REPFOR (SEQ ID 04) and ORIREV (SEQ ID 02) in a 50ul reaction containing 
0.25mM dNTPs, 2.5 units Taqplus Precision DNA polymerase, and lx PCR reaction 
buffer (Stratagene Inc, Amsterdam, Netherlands). The V5REPFOR primer anneals to 
10 the 5'-end of the primary reaction product and appends the 3' part of the V5 epitope 
DNA. The ORIREV primer anneals to the 3'-end of the primary reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 2 minutes, followed by a single cycle 10 minutes at 72°C. Reaction 
1 5 products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40ul sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(d) Fourth amplification. One ul (500 pg) of 100 times diluted pCKV5 
plasmid using 12.5pmol of each of the primers MYCCKFOR (SEQ ID 05). and 
20 CKREV (SEQ ID 06) in a 50ul reaction containing 0.25mM dNTPs, 2.5 units 

Taqplus Precision DNA polymerase, and lx PCR reaction buffer (Stratagene Inc, 
Amsterdam, Netherlands). The pCKV5 plasmid contains the human kappa constant 
region cDNA (McGregor DP, Molloy PE, Cunningham C, & Harris WJ. 1994 Mol. 
Immunol. 3 1 : 219-26) and the V5 epitope DNA (Southern JA, Young DF, Heaney F; 
25 Baumgartner WK, Randall RE. 1991 J. Gen. Virol. 72: 1551-7). The MYCCKFOR 
primer anneals to the 5'-end of the kappa constant region DNA and appends the 3' 
part of the MYC epitope DNA. The CKREV primer. anneals to the 3'-end of the 
kappa constant region DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
30 minutes and 15 seconds at 94*C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
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seconds; 72°C, 2 minutes, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40pl sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 
5 (e) Fifth amplification. One ^1 (500 pg) of 100 times diluted pCKV5 plasmid 

using 12.5pmol of each of the primers MYCV5FOR (SEQ ID 07) and V5REV (SEQ 
ID 08) in a 50\xl reaction containing 0.25mM dNTPs, 2.5 units Taqplus Precision 
DNA polymerase, and lx PCR reaction buffer (Stratagene Inc, Amsterdam, 
Netherlands), The MYCV5FOR primer anneals to the 5' -end of the V5 epitope DNA 
1 0 and appends the 3 ' part of the MYC epitope DNA. The V5REV primer anneals to the 
3'-end of the V5 epitope DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
1 5 products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40^1 sterile water using a Geneclean II kit according to the manufacturers 
instructions (Bio 1 0 1 , La Jolla, California, U.S.A.). 

(f) Sixth amplification. One pi (500 pg) of 100 times diluted pTACP2A 
plasmid (ref) using 12.5pmol of each of the primers TAC3 (SEQ ID 09) and 
20 MYCTACREV (SEQ ID 10) in a 50jxl reaction containing 0.25mM dNTPs, 2.5 units 
' Taqplus Precision DNA polymerase, and lx PCR reaction buffer (Stratagene Inc, 
Amsterdam, Netherlands). The TAC3 primer anneals to the 5' -end of the TAC 
promoter DNA. The MYCTACREV primer anneals to the 3'-end of the TAC 
promoter DNA and appends the 5' part of the MYC epitope DNA. 
25 PCR reactions were carried out on a Eppendorf Master Cycler for i cycle of 2 

minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40jxl sterile water using a Geneclean II kit according to the manufacturers 
30 instructions (BiolOl, La Jolla, California, U.S.A.). 




(g) First assembly PCR. One ul (50 ng) of each of the reaction products in (f) 
and (d) using 50 pmol of each of the primers TAC5 (SEQ ID 1 1) and CKKEV (SEQ 

IDni6) _ ih^50plleacU6^ 

polymerase mixture (20:1), and lx PCR reaction buffer (New England Biolabs, 
5 Beverly, MA, U.S .A.). The TAC5 primer anneals to the 5'-end of the reaction 

product (f) and adds 20 nucleotides. The CKREV primer anneals to the 3'-end of the 
reaction product (d). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 9.4°C followed by 30 cycles of 94°C, 45 seconds; 60'C, 45 
10 seconds; 72°C, 45 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40ul sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S A.). 

(h) Second assembly PCR. One ul (50 ng) of each of the reaction products in 
15 (f) and (e) using 50 pmol of each of the primers TAC5 (SEQ ID 1 1) and V5REV 
(SEQ ID 08) in a 50ul reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent 
DNA polymerase mixture (20:1), and lx PCR reaction buffer (New England Biolabs, 
Beverly, MA, U.S.A.). The TAC5 primer anneals to the 5'-end of the reaction 
product (f) and adds 20 nucleotides. The V5REV primer anneals to the 3'-end of the 

20 reaction product (e) . 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72*C, 45 seconds, followed by a single cycle 10 minutes at 72'C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
25 the gel into 40ul sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(i) "Third assembly PCR. One ul (5 0 ng) of each of the reaction products in (b) 
and (g) or using 50 pmol of each of the primers TAC3 (SEQ ID 09) and ORIREV 
(SEQ ED 02) in a 50ul reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent 
30 DNA polymerase rmxture (20: 1), and lx PCR reaction buffer (New England Biolabs, 
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Beverly, MA, U.S. A.). The TAC3 primer anneals 20 nucleotides downstream to the 
5' -end of the reaction product (g). The ORIREV primer anneals to the 3 5 -end of the 
reaction product (b). The reaction product in (i) is called TAC-MYC-CK-REPA-CIS- 
ORI (SEQ ID 12). 

5 PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 

minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40 pi sterile water using a Geneclean.II kit according to the manufacturers 
10 instructions (BiolOl, La Jolla, California, U.S.A.). 

(j) Fourth assembly PCR. One pi (50 ng) of each of the reaction products in 
(b) and (h) or using 50 pmol of each of the primers TAC3 (SEQ ID 09) and 
ORIREV (SEQ ID 02) in a 50|il reaction containing 0.25mM dNTPs, 2.5 units 
TaqDeepVent DNA polymerase mixture (20:1), and lx PCR reaction buffer (New 
15 England Biolabs, Beverly, MA, U.S.A.). The TAC3 primer anneals 20 nucleotides 
downstream to the 5'-end of the reaction product (g). The ORIREV primer anneals to 
the 3'-end of the reaction product (b). The reaction product in (i) is called TAC- 
MYC-V5-REPA-CIS-ORI (SEQ ID 13). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
20 minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40jj1 sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

25 

Preparation of in vitro transcription/translation reaction. The reaction was set 
up on ice, using a Promega bacterial linear template S30 coupled in vitro 
transcription/translation reaction kit as follows: 

20jlU TAC-MYC-CK-REPA-CIS-ORI template (O.Sjig of final construct DNA SEQ 
30 ID 012 above); 20pl TAC-MYC-V5-REPA-CIS-ORI template (O.Sfxg of final 




construct DNA SEQTD 013 above); 20pl complete amino acid mix (Promega); 80pl 
S30 Premix; 60(0,1 S30 mix; 

and the reaction was allowed to proceed at 25°C for 30 minutes and placed on ice, . 
then diluted 10 fold with blocking buffer (Superblock (Perbio Ltd), 0.1 % Tween 20, 

5 200|ag/ml herring sperm DNA). 

DNA-protein complex capture. NUNC star immunotubes were coated with 
10|-ig/ml of either anti-c-myc antibody, anti-V5 antibody, or anti-human kappa chain 
antibody, in 500\x\ PBS per tube overnight at 4°C. An additional tube was left blank 
as a negative control. Tubes were washed 2x PBS and blocked for 1 hour at room 

10 temperature with Superblock/PBS/O.lmg/ml herring sperm DNA/ 0.1% Tween 20 
and then washed 2x PBS. 500jj1 of diluted transcription/translation reaction was 
added to each tube and incubated at room temperature for 1 hour. Tubes were washed 
5x PBS/0.1% Tween 20, then lx 30 minutes with 2ml Superblock/PBS/O.lmg/ml 
herring sperm DNA/ 0.1% Tween 20, then 5x PBS. DNA was recovered with 300jj.1 

1 5 T.E. buffer plus 300jxl phenol/chloroform for 5 minutes with shaking. This was 

centrifuged at 13,200g for 5 minutes and DNA precipitated with 0.5 volume of 7.5M 
ammonium acetate, 20|ig glycogen and three volumes of absolute ethanol. Following 
centrifiigation, pellets were washed with 70% ethanol, vacuum dried and resuspended 
in 20\x\ water. 10pl of recovered DNA was reamplified in 50pl reactions with TAC3 

20 (SEQ ID 09) and ORIREV (SEQ ID 02) primers. Reaction products were 
electrophoresed on a 1% agarose/TAE gel (Figure 1). 

Example 2. Separating the RepA-DNA complex 

. The two in vitro expression constructs (SEQID12 and SEQID13) already 
25 described in example 1 were used in a selection experiment against anti-human C- 
kappa antibody as described in Example 1, except that DNA was recovered and 
released from RepA by using either of following methods; Glycine, Triethylamine, 
Phenol/Chloroform, Proteinase K, and EDTA. These methods are described below. 
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Glycine: tube was incubated with 500|il of 200mM Glycine, 150mM NaCl 
(pH2.0) for 10 minutes. The glycine eluate was then transferred to a fresh eppendorf 
tube and 50jal of 2M Tris (pH 8.5) added. 

Triethylamine: the tube was incubated 500|il of 0.1M Triethylamine for 10 
5 minutes and the triethylamine eluate was then transferred to a fresh eppendorf tube 
and 250jil of 1M Tris (pH 7.4) added. 

Phenol/Chloroform: as example 1 above. 

Proteinase K: the tube was incubated with 500^1 of lOOmM Tris (pH 8.0), 10 
mM EDTA (pH 8.0), 0.5% SDS for 30 minutes at 37°C. The Proteinase K eluate was 
10 then transferred to a fresh eppendorf tube. 

EDTA: the tube wasincubated with 250|xl of lOmM Tris (pH 8.0), 1 mM 
EDTA 500mM NaCl and 250|il of Phenol/Chloroform for 5 minutes. The EDTA 
eluate was then transferred to a fresh eppendorf tube. 

After recovery of DNA the DNA was Phenol/Chloroform extracted, where 
1 5 appropriate, followed by Ethanol precipitation as described in Example 1 . 1 Oul of 
resuspended DNA was reamplified in 50ul reactions with TAC3 (SEQID09) and 
CISREV (SEQID019) primers. The CISREV primer anneals 196 bases upstream of 
the binding site of ORIREV (SEQID02). Reaction products were electrophoresed'ori 
a 1% agarose/TAE gel (data not shown). Only the CIG-DNA containing construct 
20 (SEQID 12) was amplified, in approximately equivalent amounts. 

This not only tells us that any of the methods described above for recovering 
and releasing DNA from Rep A can be used, but this result also suggests that Rep A 
interacts in a non-covalent manner with its cognate DNA. 

25 Example 3. Detection of specific anti-V5 binders in a V5-spiking expeiment 
using CIS display technology. 

The in vitro expression constructs were prepared by adding the TAC 
promoter and either the V5 epitope or a- 12-mer NNB library to the RepA-CIS-ORI 
region, by PCR amplification. Such constructs can be prepared by many methods 
30 known to one skilled in the art, for example, by amplifiying different fragments of 
DNA followed by assembly PCR. In this example, the initial amplification template 




was the Rl plasmid which contains the RepA-CIS-ORI region (Masai, H. and Arai, 

K.(l 988). Nucleic Acids Res. 1 6, 6493-65 14). 

(a). Primaiy"Snplification. The RepA-CIS-ORI regio n" was P (^amplified 

from a single colony of the strain ECO K12 harbouring plasmid Rl using 12.5pmol 
5 of each of the primers REPAFOR (SEQ ID 01) and ORIREV408 (SEQ ID 20) in a 

50|il reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase 
■ mixture (20:1), and Ix PCR reaction buffer (New England Biolabs, Beverly, MA, 

U.S.A.). The REPAFOR primer anneals to the 5'-end of the RepA coding region. 

The ORIREV408 primer anneals to the downstream of the 3 '-end of the non-coding 
10 ORI region. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
' minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
15 the gel into 40|al sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(b). Secondary amplification. One |jd (500 pg) of 1 00 times diluted gel- 
purified primary reaction product was re-amplified using 12.5pmol of each of the 
primers V5(NNB)REPFOR (SEQ ID 21) and ORIREV408 (SEQ ID 20) in a 50|xl 
20 reaction containing-0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase 

mixture (20: 1), and lx PCR reaction buffer (New England Biolabs, Beverly, MA, 
U.S.A.). The V5 (NNB)REPFOR primer anneals to the 5'-end of the primary reaction 
product and appends the V5 epitope DNA. The ORIREV408 primer anneals to the 
3 ' -end of the primary reaction product. 
25 PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 

minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction • 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40|uil sterile water using a Geneclean II kit according to the manufacturers 
30 instructions (BiolOl, La Jolla, California, U.S.A.). 
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(c). Third amplification. One pi (500 pg) of 100 times diluted gel-purified 
primary reaction product was re-amplified using 12.5pmol of each of the primers 
NNBREPFOR (SEQ ID 22) and ORIREV408 (SEQ ID 20) in a 50|il reaction 
containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase mixture 

5 (20: 1), and lx PCR reaction buffer (New England Biolabs, Beverly, MA, U.S.A.). . 
The NNBREPFOR primer anneals to the 5' -end of the primary reaction product and . 
appends a random amino acid 12-mer NNB library DNA. The ORIREV408 primer 
anneals to the 3' -end of the primary reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 

10 minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40jal sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

15 (d). Fourth amplification. One pi (500 pg) of 100 times diluted pTACP2A 

plasmid (ref) using 12.5pmol of each of the primers TACFARUP (SEQ ID 23) and 
TACREV (SEQ ID 27) in a 50jal reaction containing 0.25mM dNTPs, 2.5 units 
TaqDeepVent DNA polymerase mixture (20:1), and lx PCR reaction buffer (New 
England Biolabs, Beverly, MA, U.S.A.). The TACFARUP primer anneals to the 5'- 

20 end of the TAC promoter DNA. The TACREV primer anneals to the 3'-end of the 
TAC promoter DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 
minutes and 45 seconds of 94 P C followed by 25 cycles of 94°C, 15 seconds; 60°C, 30 
seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 

25 products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40pl sterile water using a Geneclean II kit according to the manufacturers 
instructions (Biol 01 , La Jolla, California, U.S.A.). 

(e). . First assembly PCR. One jxl (50 ng) of each of the reaction products in 
(b) and (d) using 50 pmbl of each of the primers TACFARUP (SEQ ID 23) and 

30 ORIREV408 (SEQ ID 20) in a 50|il reaction containing 0.25mM dNTPs, 2.5 units 
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TaqDeepVent DNA polymerase mixture (20:1), and lx PCR reaction buffer (New 
England Biolabs, Beverly, MA, U.S A.). The TACFARUP primer anneals to the 5'- 

end-of-me-reaction-pro^^ primer anneals to the 3'-end of the 

reaction product (b). The reaction product in (e).is called TAC-V5-REPA-CIS-ORI- 
408 (SEQ ID 24). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 
minutes and 45 seconds of 94'C followed by 25 cycles of 94°C, 15 seconds; 60'C, 30 
seconds; 72'C, 1 minute and 30 seconds, followed by a single cycle 10 minutes at 
72°C. Reaction products were electrophoresed on an agarose gel, excised and 
) products purified from the gel into 40\d sterile water using a Geneclean II kit 

according to the manufacturers instructions (BiolOl, La Jolla, California, U.S A.). 

(f). Second assembly PCR One ul (50 ng) of each of the reaction 
products in (c) and (d) using 50 pmol of each of the primers TACFARUP (SEQ ID 
23) and ORIREV408 (SEQ ID 20) in a 50ul reaction containing 0.25mM dNTPs, 2.5 
5. units TaqDeepVent DNA polymerase mixture (20: 1), and lx PCR reaction buffer 
(New England Biolabs, Beverly, MA, USA.). The TACFARUP primer anneals to 
the 5'-end of the reaction product (d). The ORIREV480 primer anneals to the 3'-end 
of the reaction product (c). . 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 
JO minutes and 45 seconds of 94°C followed by 25 cycles of 94°C, 15 seconds; 60°C, 30 
seconds; 72°C, 1 minute and 30 seconds, followed by a single cycle 10 minutes at 
72° C. Reaction products were electrophoresed on an agarose gel, excised and 
products purified from the gel into 40ul sterile water using a Geneclean II kit 
according' to the manufacturers instructions (BiolOl, La Jolla, California, U.S.A.). 
25 The reaction product in (f) is called TAC-NNB-REPA-CIS-ORI-408 (SEQ ID 25). 

Preparation of in vitro transcription/translation reaction: The reaction set was 
set up on ice, using a Promega bacterial linear template S30 coupled in vitro 
transcription/translation reaction kit as follows: 

20ul of 5000 times diluted TAC-V5-REPA-CIS-ORI-408 template (O.lng of final 
30 construct DNASEQ ID 24 above) 
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20^1 of 5 TAC-NNB-REPA-CIS-ORI-408 template (0.5\ig of final construct 
DNASEQ ID 25 above) 
20\il complete amino acid mix (Promega) 
80|J.l S30 Premix 
5 60pl S30 mix 

and the reaction was allowed to proceed at 25 °C for 30 minutes and placed on 
ice, then diluted 10 fold with 2% Marvel/PBS. 

DNA-protein complex capture. NUNC star immunotubes were coated with 
lOng/mi of anti-V5 antibody in 500|il PBS overnight at 4°C. An additional tube was 
10 left blank as a negative control. Tubes were washed 2x PBS and blocked for 1 hour at 
room temperatue with blocking buffer (2% Marvel, 0.1% Tween 20, O.lmg/ml 
herring sperm DNA) and then washed 2x PBS. 1 ml of diluted 
transcription/translaiton reaction was added to each tube and incubated at room 
temperature for 1 hour. Tubes were washed 5x PBS/0.1% Tween 20 and then 5x 
1 5 PBS. DNA was recovered with 500pl TE buffer plus 500jil phenol/chloroform. This 
was centrifiiged at 13,200g for 5 minutes and DNA precipitated with 1/10 volume of 
3M sodium acetate, 50[xg/ml glycogen and two voulmes of absolute ethanol. 
Following centriflxgation, pellets were washed with 70% ethanol, vacuum dried and 
resuspended in 40(11 water. 20 pi of recovered DNA was reamplified in 50 |il 
20 reactions with the biotinylated primers bTAC6 (SEQ ID 26) and bCISREV (SEQ ID 
19). Reaction products were electrophoresed on a 1% agarose/TAE gel. 

Cloning of recovered DNA into the expression vector pDMG-K (SEQ ID 27). 
Reaction product were gelpurified and eluted with 50\xl sterile water using a 
QIAquick Gelextracation kit according to the manufacturers instructions (QIAGEN 
25 LtdWest Sussex, U.K.). Both the purified reaction product and the plasmid pDMG-K 
were digested with 20 units of Ncol and NotI (New England Biolabs, Beverly, MA, 
U.S.A.). Hie cut plasmid was gelpurified using a QIAquick Gelextracation kit 
according to the manufacturers instructions (QIAGEN LtdWest Sussex, U.K.), then 
treated with 0.01 units of Calf Intestinal Alkaline Phosphatase (Promega, 
30 Southampton, U.K.) followed by phenol/chloroform extraction and ethanol 

precipitation as described above. Precipitated DNA was dissolved in 20pl of water. 
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The cut PCR product was transferred to Streptavidin coated strips (Roche 
Diagnostics Ltd, East Sussex, U,K.) in lx TBS, 0.3 mg/ml BSA, 0.1% Tween 20 and 

— IncuDatedTfor^THi^ — 
flanking biotinylated DNA upstream and downstream of the Ncol and NotI site of the 

i PCR product and enables recovery of the small DNA fragment containing the 

selected peptide sequence. Supernatant was phenol/chloroform extracted and ethanol 
precipitated as described above. Precipitated DNA was dissolved in lOul of water. 
Cut plasmid and the isloated small DNA fragment containing the selected peptide 
sequence, both having Ncol and NotI overhangs, were ligated using a Quick ligation 

D kit according to the manufacturers instructions (New England Biolabs, Beverly, MA, 
U.S.A.) followed by phenol/chloroform extraction and ethanol precipitation as 
described above. Precipitated DNA was dissolved in lOul of water and electroporated 
into electrocompetent TGI cells according to the manufacturers instructions 
(Stratagene, U.S.A.) and selected on plates with 2xTY, lOOug/ml ampicillin, and 2% 

5 glucose. 

Anti-V5 antibody ELISA screening of selected clones. 88 colonies were 
picked into 400ul of 2x TY, 2% glucose, and 1 OOug/ml ampicillin and grown 
overnight at 37*C, shaking 300 rpm. 50ul of the overnight cultures were transferred 
into lml. of 2x TY, 2% glucose, and 1 OOug/ml ampicillin and grown at 37°C, shaking 
20 300 rpm until OD 0.5. Then the cells were centrifuged at lOOOx g for 10 minutes. 
The supernatants were discarded and pellets were resuspended in 600ul of 2x TY, 
0.4M sucrose, 1 OOug/ml ampicillin, and ImM IPTG and grown for 4 hours at 37°C, 
300 rpm. After induction the cells were centrifuged at lOOOx g for 10 minutes. 150ul 
of the supernatants were used in the ELISA test. NUNC Maxisorp plates were coated 
25 with lOOul of lug/ml in lx PBS of either anti-human kappa region antibody or anti- 
V5 antibody or 50ug/ml of BSA for 7 hours at room temperature. An additional plate 
was left blank, only coated with PBS. Wells were rinsed 2x PBS followed by 
blocking for 1 hour at room temperature with 300ul of 4% Marvel, 0.1% Tween in 
lx PBS. Wells were rinsed 2x PBS, then 150ul of supernatant and 150ul of 4% 
30 Marvel, 0.1% Tween 20 in lx PBS were added to wells and incubated for 1 hour at 
room temperature. Wells were then washed 2xPBS, 0.1% Tween 20 and 2x PBS. 
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Secondary antibody anti-human kappa region antibody conjugated to horse radish 
peroxidase (HRP)(final concentration 1.6p,g/ml) was diluted 500 times in 4% Marvel, 
0.1% Tween 20, Ix PBS and added to wells and incubated for 1 hour at room 
temperature. Wells were then washed 4x PBS, 0.1% Tween 20 and 2x PBS. The . 

5 HRP signal was detected by adding 200[i\ of TMB substrate. Reaction was stopped 
with lOOjxl of 0.5M sulphuric acid. Absorbance was read at 450nm. 35 out of 88 
clones expressed well judged by HRP signal from clones screened against anti- 
human kappa region antibody. 7 out of these 35 clones showed specific binding to 
anti-V5 antibody, thereby enriching V5-peptides from 1 in 5000 to 1 in 5, i.e. an 

1 0 enrichment factor of 1 000 (Figure 5). 
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SEQUENCE LISTING 

<110> ISOGENICA LIMITED 

<120>~ PEPTIDE LIBRARY DISPLAY METHOD 

<130> P. 86234 SER 

<160> 19 

<170> Patent In version 3.1 

<210> 1 

<211> 23 

<212> DNA • 

<213> Artificial isequence 
<220> 

<223> Primer 

<400> 1 

actgatcttc accaaacgta tta 



<210> 2 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 2 

tgcatatctg tctgtccaca gg 



<210> a 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 3 

gagcttcaac aggggagggg gaggaggatc aactgatctt caccaaac 



<210> 4 
<211> 50 
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<212> DNA ■ 

<213> Artificial sequence 

<220> . " 

<223> Primer 

<400> 4 

ctaggactgg attcaacggg gggaggagga tcaactgatc ttcaccaaac 50 

<210> 5 

<211> 49 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 6 

tcccctgttg aagctctttg tg. 22 



<210> 7 

<211> 40 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<400> 5 

cagaagagga tctgaatggg ggaggagggt ccactgtggc tgcaccatc 
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<400> 7 

cagaagagga tctgaatggg ggaggagggt ccggaaaacc 



40 



<210> 8 
<211> 27 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Primer _ _ 



<400> 8 

gctacgttga atccagtcct aggagag 



<210> 9 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 9 

catattgtcg ttagaacgcg gc 



<210> 10 

<211> 58 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 10 

attcagatcc tcttctgaga tgagtttttg ttcctcgagc atggtagatc ctgtttcc 



<210> 11 
<211> 42 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> . Primer 
<400> 11 

cgatacctag cgttcggatc catattgtcg ttagaacgcg gc 



<210> 12 

<211> 1788 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> DNA construct 
<400> 12 : 

catattgtcg ttagaacgcg gctacaatta atacataacc ttatgtatca tacacatacg 60 

atttaggtga cactatagaa tacaagctta ctccccatcc ccctgttgac aattaatcat 120 

ggctcgtata atgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acaggatcta 180 

ccatgctcga ggaacaaaaa ctcatctcag aagaggatct gaatggggga ggagggtcca 240 

ctgtggctgc accatctgtc ttcatcttcc cgccatctga tgagcagttg aaatctggaa 300 

ctgcctctgt tgtgtgcctg ctgaataact tctatcccag agaggccaaa gtacagtgga 360 

aggtggataa cgccct'ccaa tcgggtaact cccaggagag tgtcacagag caggacagca 420 

aggacagcac ctacagcctc agcaacaccc tgacgctgag caaagcagac tacgagaaac 480 

acaaagtcta cgcctgcgaa gtcacccatc agggcctgag ctcgcccgtc acaaagagct 540 

tcaacagggg agggggagga ggatcaactg atcttcacca aacgtattac cgccaggtaa 600 

agaacccgaa tccggtgttc actccccgtg aaggtgccgg aacgccgaag ttccgcgaaa . 660 

aaccgatgga aaaggcggtg, ggcctcacct cccgttttga tttcgccatt catgtggcgc 720 

atgcccgttc ccgtggtctg cgtcggcgca tgccaccggt gctgcgtcga cgggctattg 780 

atgcgctgct gcaggggctg tgtttccact atgacccgct ggccaaccgc gtccagtgtt 840 

ccatcaccac actggccatt gagtgcggac tggcgacaga gtccggtgca ggaaaactct 900 

ccatcacccg tgccacccgg gccctgacgt tcctgtcaga gctgggactg attacctacc 960 

agacggaata tgacccgctt atcgggtgct. acattccgac cgacatcacg ttcacactgg 1020 

ctctgtttgc tgcccttgat gtgtctgagg atgcagtggc agctgcgcgc cgcagtcgtg 1080 

ttgaatggga aaacaaacag cgcaaaaagc aggggctgga taccctgggt atggatgagc 1140 

tgatagcgaa agcctggcgt tttgtgcgtg agcgtttccg cagttaccag acagagcttc 1200 

agtcccgtgg aataaaacgt gcccgtgcgc gtcgtgatgc gaacagagaa cgtcaggata 1260 

tcgtcaccct agtgaaacgg cagctgacgc gtgaaatctc ggaaggacgc ttcactgcta 1320 

atggtgaggc ggtaaaacgc gaagtggagc gtcgtgtgaa ggagcgcatg attctgtcac 1380 

gtaaccgcaa ttacagccgg ctggccacag cttctccctg aaagtgatct cctcagaata 1440 

atccggcctg cgccggaggc atccgcacgc ctgaagcccg ccggtgcaca aaaaaacagc 1500 

gtcgcatgca aaaaacaatc tcatcatcca ccttctggag catccgattc cccctgtttt 1560 

taatacaaaa tacgcctcag cgacggggaa ttttgcttat ccacatttaa ctgcaaggga 1620 

cttccccata aggttacaac cgttcatgtc ataaagcgcc agccgccagt cttacagggt 1680 

gcaatgtatc ttttaaacac ctgtttatat ctcctttaaa ctacttaatt acattcattt 1740 

aaaaagaaaa cctattcact gcctgtcctg tggacagaca gatatgca 1788 



<210> 13 
<211> 1518 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA construct 
<400> 13 

catattgtcg ttagaacgcg gctacaatta atacataacc ttatgtatca tacacatacg 60 
atttaggtga cactatagaa tacaagctta ctccccatcc ccctgttgac aattaatcat 120 
ggctcgtata atgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acaggatcta 180 
ccatgctcga ggaacaaaaa ctcatctcag aagaggatct gaatggggga ggagggtccg 240 
gaaaacctat cccaaaccct ctcctaggac tggattcaac ggggggagga ggatcaactg 300 



ft 



40 




atcttcacca aacgtattac cgccaggtaa agaacccgaa tccggtgttc actccccgtg 360 

aaggtgccgg aacgccgaag ttccgcgaaa aaccgatgga aaaggcggtg ggcctcacct 420 

cccgttttga tttcgccatt catgtggcgc atgcccgttc ccgtggtctg cgtcggcgca 480 

tgccaccggt gctgcgtcga cgg gctattg atgcgctgct gcaggggctg tgtttccact 540 

atgacccgct ggccaaccgc gtccagtgtt ccatcaccac actggccatt gagtgcggac 600 

tggcgacaga gtccggtgca ggaaaactct ccatcacccg tgccacccgg gccctgacgt 660 

tcctgtcaga gctgggactg attacctacc agacggaata tgacccgctt atcgggtgct 720 

acattccgac cgacatcacg ttcacactgg ctctgtttgc tgcccttgat gtgtctgagg 780 

atgcagtggc agctgcgcgc cgcagtcgtg ttgaatggga aaacaaacag cgcaaaaagc 840 

aggggctgga taccctgggt atggatgagc tgatagcgaa agcctggcgt tttgtgcgtg 900 

agcgtttccg cagttaccag acagagcttc agtcccgtgg aataaaacgt gcccgtgcgc 960 

gtcgtgatgc gaacagagaa cgtcaggata tcgtcaccct agtgaaacgg cagctgacgc 1020 

gtgaaatctc ggaaggacgc ttcactgcta atggtgaggc ggtaaaacgc gaagtggagc 1080 

gtcgtgtgaa ggagcgcatg attctgtcac gtaaccgcaa ttacagccgg ctggccacag 1140 

cttctccctg aaagtgatct cctcagaata atccggcctg cgccggaggc atccgcacgc 1200 

ctgaagcccg ccggtgcaca aaaaaacagc gtcgcatgca aaaaacaatc tcatcatcca 1260 

ecttctggag catccgattc cccctgtttt taatacaaaa tacgcctcag cgacggggaa 1320 

ttttgcttat ccacatttaa ctgcaaggga cttccccata aggttacaac cgttcatgtc 1380 

ataaagcgcc agccgccagt cttacagggt gcaatgtatc ttttaaacac ctgtttatat 1440 

ctcctttaaa ctacttaatt acattcattt aaaaagaaaa cctattcact gcctgtcctg 1500 

tggacagaca gatatgca 1518 



<210> 14 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Estrogen Receptor Target Recognition Sequence 



<210> 15 

<211> 828 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> repA sequence 
<220> 

<221> CDS 

<222> (D..C828) 
<223> 



<400> 14 

tcaggtcaga gtgacctgag ctaaaataac acattcag 



38 



<400> 15 

atg gta aag aac ccg aat ccg gtg ttc act ccc.cgt gaa ggt gcc gga 



48 



41 

Met Val Lys AsrTPro Asn Pro Val Phe Thr Pro Arg Glu Gly Ala Gly 
15 10 15 

acg ccg aag ttc cgc gaa aaa ccg atg gaa aag gcg gtg ggc etc acc 
Thr Pro Lys Phe Arg Glu Lys Pro Met Glu Lys Ala Val Gly Leu Thr 
20 . 25 30 



96 



tec cgt ttt gat ttc gee att cat gtg gcg cat gec cgt tec cgt ggt 144 
Ser Arg Phe Asp Phe Ala He His Val Ala His Ala Arg Ser Arg Gly 
35 40 45 

ctg cgt egg cgc atg cca ccg gtg ctg cgt cga egg get att gat gcg 192 
Leu Arg Arg Arg Met Pro Pro Val Leu Arg Arg Arg Ala lie Asp Ala 
50 55 60 



ctg ctg cag ggg ctg tgt ttc cac tat gac ccg ctg gec aac cgc gtc 
Leu Leu Gin Gly Leu Cys Phe His Tyr Asp Pro Leu Ala Asn Arg Val 
65 70 75 80 



240 



cag tgt tec ate acc aca ctg gee att gag tgc gga ctg gcg aca gag 
Gin Cys Ser lie Thr Thr Leu Ala lie Glu Cys Gly Leu Ala Thr Glu 
85 90 95 



288 



tec ggt gca gga aaa etc tec ate acc cgt gee acc egg gec ctg acg 
Ser Gly Ala Gly Lys Leu Ser lie Thr Arg Ala Thr Arg Ala Leu Thr 
100 105 110 



. 336 



ttc ctg tea gag ctg gga ctg att acc tac cag acg gaa tat gac ccg 384 

Phe Leu Ser Glu Leu Gly Leu He Thr Tyr Gin Thr Glu Tyr Asp Pro 
115 120 .125 

ctt ate ggg tgc tac att ccg acc gac ate acg ttc aca ctg get ctg 432 

Leu He Gly Cys Tyr He Pro Thr Asp He Thr Phe Thr Leu Ala Leu 
130 135 140 



ttt get gec ctt gat gtg tct gag gat gca gtg gca get gcg cgc cgc 
Phe Ala Ala Leu Asp Val Ser Glu Asp Ala Val Ala Ala Ala Arg Arg 
145 150 155 160 



480 



agt cgt gtt gaa tgg gaa aac aaa cag cgc aaa aag cag ggg ctg gat 
Ser Arg Val Glu Trp Glu Asn Lys Gin Arg Lys Lys Gin Gly Leu Asp 
165 170 175 



528 



acc ctg ggt atg gat gag ctg ata gcg aaa gee tgg cgt ttt gtg cgt 
Thr Leu Gly Met Asp Glu Leu He Ala Lys Ala Trp Arg Phe Val Arg 
180 185 190 



576 



gag cgt ttc cgc agt tac cag aca gag ctt cag tec cgt gga ata aaa 
Glu Arg Phe Arg Ser Tyr Gin Thr Glu Leu Gin Ser Arg Gly lie Lys 
195 200 205 



624 
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cgt gcc cgt gcg cgt cgt gat gcg aac aga gaa cgt cag gat ate gtc 
Arg Ala Arg Ala Arg Arg Asp Ala Asn Arg Glu Arg Gin Asp He Val 

210 215 220 



acc eta gtg aaa egg cag ctg acg cgt gaa ate teg gaa gga cgc ttc 
Thr Leu Val Lys Arg Gin Leu Thr Arg Glu He Ser Glu Gly Arg Phe 
225 230 235 240 

act get aat ggt gag gcg gta aaa cgc gaa gtg gag cgt cgt gtg aag 
Thr Ala Asn Gly Glu Ala Val Lys Arg Glu Val Glu Arg Arg Val Lys 
245 250 255 

gag cgc atg att ctg tea cgt aac cgc aat tac age egg ctg gcc aca 
Glu Arg Met lie Leu Ser Arg Asn Arg Asn Tyr Ser Arg Leu Ala Thr 
260 265 270 

get tct ccc tga 
Ala Ser Pro 
275 



672 



720 



768 



816 



828 



<210> 16 
<211> 275 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> repA sequence 
<400> 16 

Met Val Lys Asn Pro Asn Pro Val Phe Thr Pro Arg Glu Gly Ala Gly 
15 io 15 

Thr Pro Lys Phe Arg Glu Lys Pro Met Glu Lys Ala Val Gly Leu Thr 
20 25 30 

Ser Arg Phe Asp Phe Ala He His Val Ala His Ala Arg Ser Arg Gly: 
35 40 45 

Leu Arg Arg Arg Met Pro Pro Val Leu Arg Arg Arg Ala He Asp Ala 
50 55 60 

Leu Leu Gin Gly Leu Cys Phe His Tyr Asp Pro Leu Ala Asn Arg Val . 
65 70 75. 80 

Gin Cys Ser He Thr Thr Leu Ala lie Glu Cys Gly Leu Ala Thr Glu 
85 90 95 
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Ser Gly Ala Gly Lys Leu Ser lie Thr Arg Ala Thr Arg Ala Leu Thr 
100 105 110 

Phe Leu Ser Glu Leu Gly Leu He Thr Tyr Gin Thr Glu Tyr Asp Pro 
115 120 125 

Leu He Gly Cys Tyr He Pro Thr Asp He Thr Phe Thr Leu Ala Leu 
130 135 140 

Phe Ala Ala Leu Asp Val Ser Glu Asp Ala Val Ala Ala Ala Arg Arg 
145 150 155 160 

Ser Arg Val Glu Trp Glu Asn Lys Gin Arg Lys Lys Gin Gly Leu Asp 
165 170 . 175 

Thr Leu Gly Met Asp Glu Leu He Ala Lys Ala Trp Arg Phe Val Arg 
180 185 190 

Glu Arg Phe Arg Ser. Tyr Gin Thr Glu Leu Gin Ser Arg Gly lie Lys 
195 200 205 

Arg Ala Arg Ala Arg. Arg Asp Ala Asn Arg Glu Arg Gin Asp lie Val 
210 215 220 . 

Thr Leu Val Lys Arg Gin Leu Thr Arg Glu He Ser Glu Gly Arg Phe 
225 230 235 240 

Thr Ala Asn Gly Glu Ala Val Lys" Arg Glu Val Glu Arg Arg Val Lys 
245 250 255 

Glu Arg Met He Leu Ser Arg Asn Arg Asn Tyr Ser Arg Leu Ala Thr 
260 265 270 

Ala Ser Pro 
275 



<210> 17 
<211> 172 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> CIS DNA element 
<400> 17 

aagtgatctc cteagaataa tccggcctgc gccggaggca tccgcacgcc tgaagcccgc 60 
cggtgcacaa aaaaacagcg tcgcatgcaa aaaacaatct catcatccac cttctggagc 120 
atccgattcc ccctgttttt aatacaaaat acgcctcagc gacggggaat tt 172 
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<210>- 18 
<211> 195 

<212>_DNA_ ' 

<213> Artificial sequence 

<220> 

<223> ori sequence 
<400> 18 

tgcttatcca catttaactg caagggactt ccccataagg ttacaaccgt tcatgtcata 
aagcgccagc cgccagtctt acagggtgca atgtatcttt taaacacctg tttatatctc 
ctttaaacta cttaattaca ttcatttaaa aagaaaacct attcactgcc tgtcctgtgg 
acagacagat atgca 



19 
20 
DNA 

Artificial sequence 
<220> 

<223> Primer 
<400> 19 

aattccccgt cgctgaggcg 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 20 

cgtaagccgg tactgattga 



<210> 21 

<211> 110 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 21 

cacaggaaac aggatctacc atggccggaa aacctatccc aaaccctctc ctaggactgg 



60 
120 
180 
195 



<210> 
<211> 
<212> 
<213> 
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attcaacggg gggaggagga tcagcggccg caactgatct tcaccaaacg 



<210> 22 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 
<220> 

<221> misc_feature 

<222> (29).. (30) 

<223> n = a, g, c or t 

<220> 

<221> misc_feature 

<222> (32).. (33) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (35).. (36) 

<223> n - a, g. c or t 

<220> 

<221> misc_feature 

<222> (38).. (39) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (41).. (42) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (44) . . (45) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (47) . . (48) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_f eature 

<222> (50) . . (51) 

<223> n = a, g. c or t 
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<220> 

<221> misc_feature 

<2222l_jL53J__(5.4) : i - - 1 

<223> n = a. g. c or t 

<220> 

<221> misc_feature. 

<222> (56).. (57) 

<223> n = a. g. c or t 

<220> 

<221> ra1sc_feature 

<222> (59).. (60) 

<223> n = a. g, c or t 

<220> 

<221> raisc_feature 

<222> (62).. (63) 

<223> n = a. g, c or t 

<400> 22 

cacacaggaa acaggatcta ccatggccnn bnnbnnbnnb nnbnnbnnbn nbnnbnnbnn 
bnnbggggga ggaggatcag cggccgcaac tgatcttcac caaacg 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 23 

cagttgatcg gcgcgagatt 

<210>. 24 
<211> 2390 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> TAC-V5-REPA-CIS-ORI-408 construct 
<400> 24 

cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag 
actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg 
gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga 
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aacgtggctg gcctggttca ccacgcggga aacggtctga taagagacac cggcatactc 240 

tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg 300 

gcgctatcat gccataccgc gaaaggtttt gcaccattcg gctagcgatg accctgctga. 360 

ttggttcgct gaccatttcc ggggtgcgga acggcgttac cagaaactca gaaggttcgt 420 

ccaaccaaac cgactctgac ggcagtttac gagagagatg atagggtctg cttcagtaag 480 

ccagatgcta cacaattagg cttgtacata ttgtcgttag aacgcggcta caattaatac 540 

ataaccttat gtatcataca catacgattt aggtgacact atagaataca agcttactcc 600 

ccatccccct gttgacaatt aatcatggct cgtataatgt gtggaattgt gagcggataa 660 

caatttcaca caggaaacag gatctaccat ggccggaaaa cctatcccaa accctctcct 720 

aggactggat tcaacggggg gaggaggatc agcggccgca actgatcttc accaaacgta 780 

ttaccgccag gtaaagaacc cgaatccggt gtttacaccc cgtgaaggtg caggaacgct 840 
gaagttctgc gaaaaactga tggaaaaggc ggtgggcttc acttcccgtt ttgatttcgc 900 
cattcatgtg gcgcatgccc gttcgcgtgg tctgcgtcga cgcatgccac cagtgctgcg 960 

tcgacgggct attgatgcgc tcctgcaggg gctgtgtttc cactatgacc cgctggccaa 1020 

ccgcgtccag tgctccatca ccacgctggc cattgagtgc ggactggcga cggagtctgc 1080 

tgccggaaaa ctctccatca cccgtgccac ccgggccctg acgttcctgt cagagctggg 1140 

actgattacc taccagacgg aatatgaccc gcttatcggg tgctacattc cgaccgatat 1200 

cacgttcaca tctgcactgt ttgctgccct cgatgtatca gaggaggcag tggccgccgc 1260 

gcgccgcagc cgtgtggtat gggaaaacaa acaacgcaaa aagcaggggc tggataccct 1320 

gggcatggat gaactgatag cgaaagcctg gcgttttgtt cgtgagcgtt ttcgcagtta 1380 

tcagacagag.cttaagtccc ggggaataaa gcgtgcccgt gcgcgtcgtg atgcggacag 1440 

ggaacgtcag gatattgtca ccctggtgaa acggcagctg acgcgcgaaa tcgcggaagg 1500 

gcgcttcact gccaatcgtg aggcggtaaa acgcgaagtt gagcgtcgtg tgaaggagcg 1560 

catgattctg tcacgtaacc gtaattacag ccggctggcc acagcttccc cctgaaagtg 1620 

acctcctctg aataatccgg cctgcgccgg aggcttccgc acgtctgaag cccgacagcg 1680 

cacaaaaaat cagcaccaca tacaaaaaac aacctcatca tccagcttct ggtgcatccg 1740 

gccccccctg ttttcgatac aaaacacgcc tcacagacgg ggaattttgc ttatccacat 1800 
taaactgcaa gggacttccc cataaggtta caaccgttca tgtcataaag cgccatccgc 1860 
cagcgttaca gggtgcaatg tatcttttaa acacctgttt atatctcctt taaactactt 1920 
aattacattc atttaaaaag aaaacctatt cactgcctgt cctgtggaca gacagatatg 1980 
cacctcccac cgcaagcggc gggcccctac cggagccgct ttagttacaa cactcagaca 2040 
caaccaccag aaaaaccccg gtccagcgca gaactgaaac cacaaagccc ctccctcata 2100 
actgaaaagc ggccccgccc cggcccgaag ggccggaaca gagtcgcttt taattatgaa 2160 
tgttgtaact acttcatcat cgctgtcagt cttctcgctg gaagttctca gtacacgctc 2220 
gtaagcggcc ctgacggccc gctaacgcgg agatacgccc cgacttcggg taaaccctcg 2280 
tcgggaccac tccgaccgcg cacagaagct ctctcatggc tgaaagcggg tatggtctgg 2340 
cagggctggg gatgggtaag gtgaaatcta tcaatcagta ccggcttacg 2390 



<210> 25 

<211> 2384 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> TAC-NNB-REPA-CIS-ORI-408 construct 



<220> 
<221> 
<222> 



mi sc_f eature 
(695) . . (696) 



<223> n = a, g r c or t 
<220> . 

~<22~1>— -mi-sGfPeat-upe 

<222> (698).. (699) 

<223> n = a f g, c or t 

<220> 

<221> miscjfeature 

<222> (701).. (702) 

<223> n = a, g, c or t 

<220> 

<221> toisc_f eature 

<222> (704).. (705) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (707).. (708) 

<223> n « a. g, c or t 

<220> 

<221> mi scjF eature 

<222> (710).. (711) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (713).. (714) 

<223> n = a, g, c or t 

<220> 

<22.1> mi sc_f eature 

<222> (716).. (717) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (719).. (720) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (722).. (723) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_f eature 

<222> (725).. (726) 
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<223> n = a. g, c or t 
<220> 

<221> misc_feature 
<222> (728).. (729) 
<223> n - a, g. c or t 

<400> 25 

cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag 60 

actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg 120 

gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga 180 

aacgtggctg gcctggttca ccacgcggga aacggtctga taagagacac cggcatactc 240 

tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg 300 

gcgctatcat gccataccgc gaaaggtttt gcaccattcg gctagcgatg accctgctga 360 

ttggttcgct gaccatttcc ggggtgcgga acggcgttac cagaaactca gaaggttcgt 420 

ccaaccaaac cgactctgac ggcagtttac gagagagatg atagggtctg cttcagtaag 480 

ccagatgcta cacaattagg cttgtacata ttgtcgttag aacgcggcta caattaatac 540 

ataaccttat gtatcataca catacgattt aggtgacact atagaataca agcttactcc 600 

ccatccccct gttgacaatt aatcatggct cgtataatgt gtggaattgt gagcggataa 660 

caatttcaca caggaaacag gatctaccat ggccnnbnnb nnbnnbnnbn nbnnbnnbnn 720 

bnnbnnbnnb gggggaggag gatcagcggc cgcaactgat cttcaccaaa cgtattaccg 780 

ccaggtaaag aacccgaatc cggtgtttac accccgtgaa ggtgcaggaa cgctgaagtt 840 

ctgcgaaaaa ctgatggaaa aggcggtggg cttcacttcc cgttttgatt tcgccattca 900 

tgtggcgcat gcccgttcgc gtggtctgcg tcgacgcatg ccaccagtgc tgcgtcgacg 960 

ggctattgat gcgctcctgc aggggctgtg tttccactat gacccgctgg ccaaccgcgt 1020 

ccagtgctcc atcaccacgc tggccattga gtgcggactg gcgacggagt ctgctgccgg 1080 

aaaactctcc atcacccgtg ccacccgggc cctgacgttc ctgtcagagc tgggactgat 1140 

tacctaccag acggaatatg acccgcttat cgggtgctac attccgaccg atatcacgtt 1200 

cacatctgca ctgtttgctg ccctcgatgt atcagaggag gcagtggccg ccgcgcgccg 1260 

cagccgtgtg gtatgggaaa acaaacaacg caaaaagcag gggctggata ccctgggcat 1320 

ggatgaactg atagcgaaag cctggcgttt tgttcgtgag cgttttcgca gttatcagac 1380 

agagcttaag tcccggggaa taaagcgtgc ccgtgcgcgt cgtgatgcgg acagggaacg 1440 

tcaggatatt gtcaccctgg tgaaacggca gctgacgcgc gaaatcgcgg aagggcgctt 1500 

cactgccaat cgtgaggcgg taaaacgcga agttgagcgt cgtgtgaagg agcgcatgat 1560 

tctgtcacgt aaccgtaatt acagccggct ggccacagct tccccctgaa agtgacctcc 1620 

tctgaataat ccggcctgcg ccggaggctt ccgcacgtct gaagcccgac agcgcacaaa 1680 

aaatcagcac cacatacaaa aaacaacctc atcatccagc ttctggtgca tccggccccc 1740 

cctgttttcg atacaaaaca cgcctcacag acggggaatt ttgcttatcc acattaaact 1800 

gcaagggact tccccataag gttacaaccg ttcatgtcat aaagcgccat ccgccagcgt 1860 

tacagggtgc aatgtatctt ttaaacacct gtttatatct cctttaaact acttaattac 1920 

attcatttaa aaagaaaacc tattcactgc ctgtcctgtg gacagacaga tatgcacctc 1980 

ccaccgcaag cggcgggccc ctaccggagc cgctttagtt acaacactca gacacaacca 2040 

ccagaaaaac cccggtccag cgcagaactg aaaccacaaa gcccctccct cataactgaa 2100 

aagcggcccc gccccggccc gaagggccgg aacagagtcg cttttaatta tgaatgttgt 2160 

aactacttca tcatcgctgt cagtcttctc gctggaagtt ctcagtacac gctcgtaagc 2220 

ggccctgacg gcccgctaac gcggagatac gccccgactt cgggtaaacc ctcgtcggga 2280 

ccactccgac cgcgcacaga agctctctca tggctgaaag cgggtatggt ctggcagggc 2340 
tggggatggg taaggtgaaa tctatcaatc agtaccggct tacg 2384 



<210> 26 
<211> 26 
<212> DNA 
^213>_^i±iiic.tal.^iequence 

<220> 

<223> Primer 
<400> 26 

ccccatcccc ctgttgacaa ttaatc 

<210> 27 

<211> 22 

<212> DNA . 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 27 

ggtagatcct gtttcctgtg tg 
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CLAIMS 

1 . A method for producing an in vitro peptide expression library 
comprising a plurality of peptides, wherein each peptide is linked to the DNA 
construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) a DNA target sequence; 

(ii) DNA encoding a library member peptide; and 

(iii) DNA encoding a peptide capable of non-covalently binding 
directly or indirectly to said DNA target sequence of (i); 

wherein said DNA construct and encoded protein are selected to have 
cis-activity 

(b) expressing a plurality of DNA constructs according to (a) wherein said 
DNA constructs encode a plurality of library member peptides such 
that each expressed peptide is non-covalently linked to the DNA from 
which it was produced. 

2. A method according to claim 1 wherein said DNA construct further 
comprises: (iv) a cis-acting DNA element. 

3. A method according to claim 2 wherein said DNA construct of (a) 

further comprises 

(v) DNA encoding a fragment comprising at least the C-terrninal 
20 amino acids of a repA protein wherein said fragment is 
capable of interacting with said cis-acting DNA element of 
(iv); 

wherein said cis-acting DNA element of (iv) is located 3' to said DNA of (ii), (iii) 
and (v). 
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4. A method according to any one of the preceding claims wherein the 
peptide encoded hy said DNA of (iii) is capable of recognising and directly binding 

said~DNA-target-sequenee-of-(i); 

5. A method according to claim 4 wherein the peptide encoded by said 
DNA of (iii) is a repA protein and wherein said DNA target sequence of (i) is ori. 

6. A method according to any one of claims 3 to 5 wherein said repA is 
selected from repA of the IncI complex plasmids and repA of the IncF, IncB, IncK, 
IncZ and IncL/M plasmids. 

7. A method according to claim 5 wherein said DNA construct 
comprises the sequence encoding rep A, the cis DNA element and the ori DNA of the 
IncFEplasmidRl. 

8. A method according to any one of the preceding claims wherein said 
repA protein has the sequence given in SEQ ID NO: 16 and wherein said cis DNA 
element has the sequence given in SEQ ID NO: 17. 

9. A method according to claim 4 wherein the peptide encoded by said 
DNA of (iii) is an oestrogen receptor DNA binding domain and wherein said DNA 
target sequence of (i) is an oestrogen receptor target sequence. 

10. A method according to claim 8 wherein said DNA binding domain 
comprises amino acids 176 to 282 of the oestrogen receptor DNA binding fragment 
and wherein said DNA target sequence comprises the oestrogen receptor target 
sequence given in SEQ ID NO: 14. 

11. A method according to claim 1 , 2 or 3 wherein the peptide encoded by 
said DNA of (iii) indirectly binds said DNA target sequence of (i) via a Afunctional 
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agent, one part of which binds said DNA target sequence of (i) and a second part of 
which binds the peptide encoded by said DNA of (Hi). 

12. A method according to claim 1 1 wherein said DNA target sequence 
comprises a DNA tag capable of being bound by said bifunctional agent, said tag 
being optionally selected from biotin and fluorescein. 

13. A method according to claim 1 1 or 12 wherein the binding activities 
of said bifunctional agent are conferred by means of two antibodies or fragments 
thereof. 

14. A method according to claim 1 3 wherein one or both of said binding 
activities are conferred by means of an Fab fragment. 

.15. A method according to any one of claims 1 1 to 14 wherein said 
bifunctional agent is provided prior to step (b). 

16. A method according to. claim 1 1 wherein said bifunctional agent is 
bound to said DNA target sequence of (i) and is capable of binding to the peptide 
encoded by said DNA of (Hi). 

17. A method according to claim 16 wherein said bifunctional agent is a 
polymer. 

1 8. A method according to any one of the preceding claims wherein said 
DNA is under the control of suitable promoter.and translation sequences to allow for 
in vitro transcription and translation. 



19. A method according to any one of the preceding claims wherein 
library member peptide is an antibody or fragment thereof. 
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20. A method according to any one of the preceding claims wherein said 
library comprises at least 10 4 molecules. 



21 . A method according to any one of the preceding claims wherein said 
expression is carried out in a coupled bacterial transcription/translation environment. 

22. A method according to claim 21 wherein said coupled bacterial 
transcription/translation environment is the S30 extract system. 

23 . A method for producing an in vitro peptide expression library 
comprising a plurality of peptides, wherein each peptide is linked to- the DNA 
construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) DNA encoding a library member peptide; and 

(ii) DNA encoding a peptide capable of binding to a bifunctional 
agent; 

wherein said DNA construct and encoded protein are selected to have 
cis-activity; 

(b) binding a bifunctional agent or a DNA tag capable of binding a 
bifunctional agent to said DNA construct of (a), wherein said 
bifunctional agent is capable of binding to the peptide encoded by said 
DNA of (ii); and 

(c) expressing a plurality of DNA constructs according to (b), wherein 
said DNA constructs encode a plurality of library member peptides 
such that each expressed peptide is linked via said bifunctional agent 
to the DNA from which it was produced. 

24. A method of identifying and/or purifying a peptide exhibiting desired 
properties from an in vitro peptide expression library produced according to the 
method of any one of the preceding claims, comprising at least the steps of (a) 
screening said library and (b). selecting and isolating the relevant library member. 
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25 . A method of identifying a specific ligand binding peptide, said 
method comprising at least the steps of (a) screening an in vitro peptide expression 
library produced according to the method of any one of claims 1 to 23 with ligand 
molecules which are optionally bound to a solid support; (b) selecting and isolating a 
library member binding to said ligand molecule; and (c) isolating the peptide which 
binds specifically to said ligand molecule. 

26 A method according to claim 24 or 25 wherein said library member 
peptides are antibodies or fragments thereof. 

27. A method of identifying and/or purifying a peptide having the ability 
to bind a specific DNA target sequence comprising at least the steps of 

(a) providing an in vitro expression library according to any one of claims 
1 to 23 wherein the peptide encoded by the DNA of (iii) is a library 
member peptide having DNA binding activity and wherein said DNA 
target sequence of (i) is the target sequence of interest; 

(b) selecting and isolating a library member in which the encoded protein 
binds to said target sequence; and 

(c) isolating the peptide which binds to said target sequence. 

28. A method according to claim 27 wherein said library member peptide; 
are zinc finger proteins, helix-loop-helix proteins or helix-turn-helix proteins. 

29. A method according to any one of claims 24 to 28 wherein 
additionally the DNA expressing said isolated peptide is isolated. 

30. An in vitro peptide expression library produced according to the 
method of any one of claims 1 to 23 . 




31. A DNA construct as described in any one of claims 1 to 23 . 
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DISPLAY LIBRARY 

Abstract : ~ ■ 

The invention provides a method for making in vitro peptide expression 
libraries, and for the isolation of nucleotide sequences encoding peptides of interest, 
wherein the peptides or proteins are specifically associated with the DNA encoding 
them through non-covalent protein:DNA binding. The method describes ways of 
making the library itself, DNA molecules encoding the library and uses of the 
expression library. 
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